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ABSTRACT 

college instructors evaluted the quality cf their own 
teaching and vere evaluated by their students in 331 different 
courses. Student evaluations of teaching correlated vith instructor 
self evaluations in courses taught by teaching assistants, in 
undergraduate courses taught by faculty, and in graduate level 
courses, demonstrating their validity at all levels of university 
teaching. Both student and instructor ratings vere reliable, and 
separate factor analyses indicated that the saie nine evaluation 
factors influenced both sets of ratings: learning/valae; instructor 
enthusiast; organization; group interaction; individual rapport; 
breadth of coverage; exailnations/grading; value of assignients; and 
vorkload/diff iculty. Student- instructor agreeient on each factor was 
independent of the factor's agreewent vith other factors. Hhile 
correlations between student and instructor ratings on the saie 
factors vere high, correlations bctveen their ratings on different 
factors were lov. This finding supports the distinctiveness of the 
different factors, as veil as the use of eultifactor evaluation 
instrusents developed vith the use of factor analytic techniques. 
These findings establish the validity and accuracy of student 
evaluations at all levels of university teaching, and suggest tha 
possible usefulness of instructor self evaluations. (Questionnaires 
are ai^pended.) (Author/HH) 
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UJ ABSTRACT * 

Instructors evaluated the quality of their own teaching and were evaluated by 
their students in each of 331 different courses. Student evaluations of teaching 
correlated with instructor self evaluations In courses taught by teaching 
assistants (ra.46), in undergraduate courses taught by faculty (r=.41), and even 
in graduate level courses (r=.39), demonstrating their validity at all levels 
of university teaching. Both student and instructor ratings were reliable, and 
separate factor analyses Indicated that the same nine evaluation factors (learning/ 
value, organization, enthusiasm, etc.) underlay both sets of ratings. Furthermore', 
student-instructor agreement on each factor was independent of its agreement on 
other factors. While correlations between student and instructor ratings 
on the same factors were high (median r=.45) correlations between thftir ratings 
on different factors was low (median r-.OO). This argues for the distinctiveness 
of the different factors and for the use of multi factor evalua*;ion instruments 
that have been developed with the use of factor analytic techniques. These findings 
establish the validity of student evaluations at all levels of university teaching, 
suggest the possible usefulness of Instructor self evaluations, and will help 
reassure faculty about the accuracy of the student ratings. 
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Validity of Students' Evaluations of Teaching: A Comparison 
with Instructor Self Evaluations by Teaching Assistants, 
Undergraduate Faculty and Graduate Faculty 

Common criticisms of students' evaluations are that they are biased by 
variables unrelated to teaching effectiveness and that they lack validity. 
However, researchers have reported considerable empirical evidence Indicating 
that most background vairlables. Including cUss size, reason for taking the 
course, workload, and grade point average, -jre not substantially related to 
student ratings (Marsh, 1978; Marsh, Overall & Thomas, 1976; McKeachle, 1973; 
Remners, 1963). In addition to this apparent lack of bias, student ratings have 
been validated against a variety of differert criteria. The most common 
criterion has been performance on a standardized examination; when different 
sections of the same courses are taught by different Instructors, the sections 
that do best on the standardized examination given to all sections are also the 
ones who evaluate their Instructors move favorably (Centra, 1977; Cohen & 
Berger, 1970; Frey, 1973; Marsh, Flelner & Thomas, 1975, Overall and Marsh, 
1978). Other researchers have successfully validated student ratings against 
the ratings of former students (Centra, 1973; Marsh, 1978; Marsh & Overall, 1979). 

Validity research such as that described above has generally been limited 
to a specialized setting or has employed criteria that are unlikely to 
convince skeptics. Thus, faculty will continue to question the usefulness 
of student ratings until validity criteria applicable across a wide range of 
classes is utilized. A criterion that meets this requirement--instructor 
slef evaluations of their own teaching--should also be acceptable to most 
faculty and administrators. Instructors can be evaluated and evaluate their 
own teaching in any instructional context, even graduate level coureses and 
courses taught by teaching assistants. Furthermore, instructors can be asked 
to evaluate their own teaching along the same dimensions employed in the student 
rating form, thereby testing the specific validity of the different rating factors. 

In spite of the apparent appeal of instructor self evaluations as a 
criterion for validating student ratings, relatively few studies have 
considered it. Centra (1973) found correlations of about .20 between faculty 
self evaluations and student ratings, but both sets of ratings were collected 
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atiBidtbrm \3 part of a lar79r project that exaainecl tha iipact Df feedback fcoi 

■Id^ter* cviluatlons. BUskburn and Clack also reported correlations of 

about .20, but th»»T only asked faculty to rate their teaching in a qeneral sense 

rather thin to rate thoir taachlnq in the specific class being evaluated by 

students. In contrast, hiqher correlations have been reported in three other 

investiqaUaas. Doyle and Crich ton ( 1978) found a median correlation of .47 

between the self ratings of teachinq assistants in 10 sections of a 

■ultlsection course and the corresponding student ratings. Webb and Nolan (1955) 

reporte-i i corrwlation of .62 in a nilitary setting in which instructors were 

not professional teachers. Marsh, Overall, and Kcsler(1979) asked regular 

faculty teashim unlergrii ua te courses to evaluate themselves on the saae 

ford that vis ur.eJ by thslr stuJents. nean differences between faculty and 

student ratings w«re small, and separate factor analyses revealed that the 

saiQ set Df evaluation factors underlay both sets of ratings. The aedian 

correlation between self-ratings and student ratings was .49. ' 

ThR Marsh, Overall and KBsler(197q) study served as a basis for the 
present one. This study, although a replication of the earlier 
research, differs in several important aspects. First, the evaluation 
instrument was expanded to include several new evaluation factors. Second, 
the sample size was increased to include 331 courses. Third, courses taught 
by teaching assistants and graduate level courses were included as well as 
undergrsdui te courses taught by faculty. 

rha present study has two purposes. First, it investigates the validity 
of student ratings for three instructional subgroups: courses taught by teaching 
assistants, undargraduata courses taught by regular faculty, and graduate level 
courses. Previous research has not considered the validity of the ratings 
in graduate leval cours?s. Second, as a consequence of the large number of 
coucses--a total ( f 331--this study permits a detailed application of the 
mult itrait-mult imethod procedure to test for both convergent and divergent 
validity. Convergent validity, which is typically considered, is based 
upon the correlation between student and faculty ratings on the saae evaluation 
fictor. 'lowever, even if ganeral convergence is demonstrated, this does not 
argue for the usefulness of the many different evaluation factors often 



e.pl0Ted, so.e users of stuient ovafSgf &Lf|^^tT^^JngIltr^tors, .nd 
researchecs--explicitly or lipUcitly assuie that aost of tbe useful inforaation 
U contained in a single overall ratinq itei or in a siiple average across a 
nusber of specific itens. This ignores the divergent or dissriainan t validity 
of the ratings. On the othar hand, the deaonstra t ion that studen t-iastructDc 
aqreeaent on any one diuansion is independent of aqceeaent on other diaesnions 
would deaoQstrate the utility of the distinct factors and arque for the 
uio of factor analytic techniques in the dovelopacnt of evaluation instruaonts. 

flBrHODOLOCy 

During the asadeaic year 1977-78 student evaluations were collected in 
virtually all courses of farad in the Division of Social Sciences at the 
University of Southern CTalifornia. Evaluations were adainisteced shortly 
bafDre the end of the teca, generally by a desiqnated student in the class 
or by staff person. Students were told that the evaluations would provide 
feedback to instructors and would be considered as part of personnel 
decisons. The surveys ware coapleted by an averaqe of 76% of the students 
enrolled in each class. 

The evaluation instruaent (See Appendix I) consisted of 35 evaluation 
iteaa adapted froa Hildabrand, Hilson e Dienst(1971) and Marsh, Dvorall e Thoaas 
(1976), The aedian reliability of indvidual evaluation iteas— intraclass 
correlat'on coefficients based upon sets of responses froa 25 students per 
class— *a3 .8a (See Appendix II). A factor analysis (Se9 Appendix III) of the 
student ratings of all undergraduate courses taught by regulaL faculty revealed 
nin« separate evaluation factors. The reliability of the factors, coefficient 
alphas, varied froa ,88 to .97 (See Appendix II). 

Instructor self ovaluation surveys were sent to all teachers who had been 
evaluated by students in at least two different courses during the saae tera. 
Tnstuctors were asked to evaluate the effectiveness of their own teaching in 
both courses. These surveys were coapleted after the end of the tera, but 
.before suaaaries of the student evaluations wore returned. Hhile participation 
was voluntary, a cover letter froa the Dean of the Division strongly encouraged 
cooperation and guaranteed the anonyaity of each teacher's response. 
Instructors evaluated both courses with a set of iteas identical to those used 
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INh'rnUCTOR SELF EVALHATIDNS U 
by itudentSf except that iteas were worded in the first person. They 
were specifically instructed to rate thoir own teaching effectiveness and not to 
report how students would rate then. A total of 181 (7BX) surveys were 
returnel. since faculty had been requested to rate the ef fectivRness of their 
teashinq in both classes they taught, self ratings for a total of 331 

courses w^re corQpleted--ratings of 183 undergraduate courses taught by faculty, 
US graduate level courses, and 103 courses taught by teichin? assistants. 

Eleven evaluation scores-- factor scores representing the nine evaluation 
factors and overall ratings of the teacher and the course were used to suiiarizt* 
the stulent ratings and the instructor self ratings. Evaluation factor 
scores were weighted averages of standardized responses to each evaluation item. 
The wei^ihts, factor FiCouh coefficients, were derived froa the factor analysis 
described in Appendix III. 

In addition to actual evaluation of thoir own teaching, faculty were 
asked to express their igreanent or disagreement with statements about 
student evaluations and other methods of evaluating the effectiveness of their 
teaching. Faculty also rated themselves and the course they taught on selected 
background variables that have been suggested as potential biases to the student 
ratings (e.g., their "qraling leniency", their "popularity with students", 
their perceptions of their students* subiect interest before the start of the 
course, etc.). Attitudes and variables faculty felt were likely to bias 
the student ratings are presented in Appendix VX; the relationship 
between both student and faculty ratings and potentially biasing variables 
are presented in Appendix YII. 

RESULTS 

Faculty Attitudes Toward Student Ratings. 

As part of the study, faculty were asked to express their agreement or 
disagreement with statements concerning student ratings, potential biases in 
student r9.tings, and other oossible methods of evaluating the guality of their 
teachinq (See Table 1). A majori t y (59X) of the faculty indica>.ed that some 
measure of teaching quality should be given more emphasis in promotional 
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TNrrHUCTOR SELF EVALUATIONS 5 
aeclHions. Faculty clearly aqroed that .student ratings vera useful to the 
faculty theasolves as fotidback, and a lalority even agreed that they shauld 
bo aade publicly available for students to use in courss seloctiDn. However, 
they were lore skeptical about the accuracy of the student ratings. 
Furthernore, fasultN were oven more critical about using classrooi visitation 
by peers or faculty self a valuations in proaotional decisions, though they were 
Roaewhat noce favorable towards the use of colleague exaaination of course 
outlines, reading lists, cAassrooa exaainations, etc. 

Insect Table 1 About Here 

Faculty were also asked to indicate the iteas in a list of "potential 
biases" that thwy believed would actually cause a substantial bias. The aast 
freguently aentioned were: Course Difficulty {72X) , Grading Lenle ncy— lots of 
A«3" (6Bt) , Instructor Popularity (63%), and Student Interest in Subject Before 
Taking :our^e (625). it was interesting to note, however, that faculty self 
evaluations of their own teaching and student evalutions of the faculty were: 
1) both pasitively related to Workload/Difficulty (harder coursos were evaluated 
Bore fa5rocably by both), 2) both positively related to faculty self ratings of 
their "popularity with students", 3) both positively related to student 
prior subject interest, and U) both uncorrelated with faculty self ratings of 
their "grading leniency". These findings suggest that threa of these variables 
— workload, prior subie::t interest, and instructor popularity — are variables 
actually relate! to quality of teaching, since each shows siailar 
relationships to two different aeasures of teaching quality. The fourth ^ * 
variable, grading leniency, is apparently unrelated to either guality of 
teaching or student ratings of quality of teaching. 

A uileaaa clearly exists. Faculty are concerned about teaching 
effectiveness, even to the extent of wanting it to play a aore inportant 
role in their own proaotions. However, many expressed doubts about any 
' of the possible aeasures of teac*>ing effectiveness that were suggested- 
including student evaluations. in particular, faculty suggested a nuaber 
of sources of potential bias in the student ratings, even though each of 
these potential biases showed siailar relationships to student and faculty 
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IMSTROCTOR SELF EVALUATIONS 6 
ratings of teaching effect ivenss. Before the potential usefulness of student 
ratings can be realizet^, faculty and adainistrators hav3 to be convinced 
that student ratings are valid and relatively free of bias. 

Factor Analysis 

Sepirate fastor inalyses were performed on student and instructor self 
ratings of the .15 evaluation Itans (See Table 2), to detemine if the evaluatio 
factors underlying student evaluations were siailar to those representing 
instructor self evaluations. Both confiraed the nine evaluation 
factors that had previously been identified. Each itet, for both 
stuJent and instrustor ratings, loaded highest on the fictor it was 
deslgnei to aeasura. loiJinqs for itess defining each factor were generally 
at least .HO ani usually exceeded .50. All other loadings were less 
than .30 and were usually less than .20. The slallarlty In the two factor 
patterns laplles that slillar diaensions underlie both studant and instructor 
ratings of effective teaching. The results of both factor analyses were also 
gulte similar to rasults of a previous factor analysis perforaed on the 
student ratings of all undergraduate courses taught by faculty (See Appendix 
IT) — Including those considered in this study. 

InsGrt Table 2 About Her? 

Several analytic techniques are available for tha coaparlson of dlfferen' 
factor analyses, but none have been thoroughly developei (Levlne, 1977). 
Target analysis, the rotation of one structure to fit the structure of another. 
Is better suite:! to aatchlng one eaplrlcal structure to a second theoretical 
structure. Purtheraore, It forces data— while capitalizing on chance— to fit 
the proposed aodel, or as suggested by Hurley and Cattell (1962), the 
procedure "lends Itself to the brutal feat of making alaost any data fit 
alaost any hypothesis." An alternative procedure developed by Cattell and 
Baggaley(1960) , the sallant variable slallarlty Index, classifies loadings Into 
those that are higher than an arbltrarllly defined substantial loading and 
those that are not. This procedure Ignores auch of the Inforaatlon In the 
loadings by converting thea into dlchotoales. Thus, at least In this appli- 
cation, careful selection of the "substantial" cutoff would result In "perfect" 
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fltj foe nil factor patterns. Lftvine ( 1977) , aaouq other procedures, suqijests 
.sUply cocrelatlny the the factor loadings, m the present application, each 
factor pattern rsee Table 2) had 315 factor loadinqs; eich of 35 iteas had 
loadinqs on each of the nina factors. Factor loadinqs for the factor analysis 
of instructor salf ratin7s correlated r = .90 with both the loadinqs fvou the 
fictor anilysis of student ratings in this study and tho prpv ious analysis of 
student ritinqs in all unlerqraduate courses tauqht by faculty; loadinqs froa 
the two factor nnalyses of student ratinqs correlated ,95 with each other. 
These ras'Jlts also confira the siailarity of the factor patterns rosultinq 
froB student and instructor ratinqs. 

• Converqent and niverqent. Validity 

Campbell and Fisk« (1959) advocate the assessnent of validity by 
detoraininq measures of aore than one trait, each of which is assessed by 
Boro than one aethod. Tn the present application, the aultiple traits are 
the nina ^valuation factors, while the aultiple methods refer to the two 
distinct groups of raters'-studen ts ratinq their instructor and the instructors 
ratinq theaselves. Convorqent validity, that which is most typically 
detoraiaed, is the correlation between the saae evaluation factors rated by 
two different qroups. Discriainant validity refers to the distinctiveness 
of each of the evaluation factors. Two different aspects of discriainant 
validity are particularly relevant to the present application. The first 
Qxaaines whether or not student-instructor aqreeaent on each factor is 
independent of aqreeaent on other factors. For example, if a single 
•»qeneralized ratinq factor" underlies both student and instructor 
ratinqs, then agreement on any particular factor aight be a function of 
aqreeaent on the qeneralized factor and not have anything to do with the 
specific content of the factor being considered. As a consequence, while 
correlations between stulant and instructor ratinqs on the sane factors would 
be hiqh, so would the correlations between their ratinqs on different factors. 
The seconi ispect of dissriainant validity considers the possibility that the 
relationship between diffarent factors as rated by the ^aae qroup of raters is 
due to the method of data collection rather than «true" relationships between 
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INSTFUCTOR SELP EVHUATIONS 8 
the uftdeclflnq aiaensions being considered. The aost likely source af this 
lethod variance in the present application would be a hilo affect. 

Convergent and discrioinant validity across all soursas was detecained 
by exaaininq tho correlation latrices in Table 3, The sorrelatians between 

different evaluation factors as assessed by the sane group of raters are 
contained in tha two triangular matrices— intercorrelations aaong instructor 
self evaluation factors (upper left) and student evaluation factars (lower 
right). The diagonals of those triangular aatrices contain the reliabilities 
of the f a.':tors--co8ff iciant alphas— for each group of raters. The square 
■atrlx (lower laft) contains tha correlations between student evaluation 
factors and instructor self evaluation factors. The diagonal of the square 
matrix (the convergent validity coefficients) contains corralations between 
the saae evaluation factors as assessed by the two different groups, since 
there is unreliability in both the student ratings (aedlan reliability =*,9ii) 
and partltTularly the Instructor self evaluations (median reliability s .82), 
the convergant validity coefficients have been corrected for unreliability. 
The set of aatrices in Table 3, referred to as a aultitrait- ault Inethod aatrix, 
was based upon the coabinad data of all three sets of classes— those taught at 
the qraduate and at the undergraduate levels by reqular faculty and those 
taught by teaching assistants. M ultl tralt-aultiaethod aatrices were also 
contructed separately for each of the three sets of classes (See Appendix IV), 

Insert Table 3 About Here 

Converqent validity requires that the diagonal values of the square 
aatrix be substantially higher than zero. Inspection of Table 3 Indicates 
that this was the case for all evaluation factors. Validity coefficients 
varied between .17 and .69 (aedian r-.'io) and all were statistically different 
froa zero. These finding denonstrate good support for the convergent validity 
of teacher avaluations. Convergent validity was also deterained separately 
for each of the three sets of courses (See Appendix IV and Table 4). The aedian 
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conferment validity coefficient was . a 1 for facalty-taught undergraduate 
courses. .39 for faculty- taught graduate courses, and .ne for undergraduate 
level courses taught by teaching assistants. Only four of these 27 
validity soefflcients— three of the nine validity coefi3ien^;s foi graduate 
level courses and onp of nine for courses taught by tea3hing assistants- 
failed to reach statistical significance. However this— as coipared 
to the soinhined data in which aU validity coefficients were significant— Wus 
a function of the rer?uc3i sample size rathec than lower validity coefficents; 
every validity coefficient in each of the sets of classes would have been 
statistically significant if based on the saae number of cases as in the 
combined iata. 



Insert Table H About Here 



Divergent validity is harder to assess, and Caopbell and Fislc(1959) 
offer only general guidelines. The niniaal condition is that all corr^ilations 
between different factors rated by the saie group (off-diagonal correlations in 
the triinTula.- matrices) aust bs substantially lower than the reliabilities 
of these factors. This tasts whether the different evaluation factors as 
iudged by the same group of raters are distinctive. This condition was clearly 
aet for instructor self evaluations, and even the moderate in torcorrelations 
aiong the student ratings (median r=.:i3) were much lower than the reliabilities 
of these factors (median r = .94). These same general conclusions hold when 
natrices far each of the three groups of courses were considered separately 
(See Appendix IV) . 

Campbell and Fisk( 1959) stated that "various statistical treattents foL 
■ultitrait-multiiethod matrices might be developed, but we feel that such 
summary statistics ar*? neither necessary nor appropriate at this t,\me.« 
instead, they suggest three genara] guidelines that have more intuitive appeal 
than guantitative rigor. While other researchers have attempted to develop 
more rigorous procedures, they have been only partially successful (See Mwin, 
1973) and most applications of the aultitrait-mul tiiethod procedure still rely 
on the orginal guidelines proposed by Campbell and Fislc(1959). 
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The first of their guidelines is that each convergent validity 
coefficient (diagonals of the square matrix in Table 3) should be lower than tl 
Any otheL ca;?rolation in the saae row or coluan of the square matrix. This 
test requires that each af the nine convergent validltf coefficients be hiqher 
than anr of th« 16 correlations in the saie row or column of the square latrix. 
f:>r pxaaple, tho validity coefficont for Instructor Euthusiasji was .54 (.48 if 
not corrected for unreliability), rhis was hiqher than any of the eight 
corcplations between student ratinqs of Enthusiasm and the eight other 
instructor self-ratinq factors, and was also hiqher than any of the eight 
correlations betwe9n instructor self-ratings of Enthusiasm and the eight 
other stuient rating factors. With one minor except ion-the Examinations 
trading factor failed tha tost in one of 16 comparisons-- this guideline 
was met in all cases. Inspection of the separate rautitrait- multimfithod 
matrices constructed for each of the three sets of courses also indicates 
that this test was met with few exceptions. The only evaluation factor 'that 
did not consistently demonstrate divergent validity was Examinations/Grading; 
it consistently passed this test for only graduate level courses taught 
by faculty. 

Their seaoni guiieline requires that each converqenl validity 
coefficient be hiqher than correlations between that factor and 
any other factor assessed by the same group of raters. For example, 
the validity coefficient for Enthusiasm (r « .5U) was higher than 

any correlation between student ratings of Enthusiasm and any other student 
rating, and was higher than the correlation between instructor ratings Df 
Enthusiasm and any other instructor rating. This guideline is the most 
stringent, and has several problems when applied to this particular setting. 
Its application implicitly assumes that the different factors are truly 
uncarrelatei— an assumption that seems unwarranted in this case. 
Thorndike (192 J) suggests, for example, that there should be little or 
no true correlttion between a teacher's intelligence and the quality of his 
voice, and the obtained correlation of .63 between ratings of thsse attributes 
clearly suggests a halo effect, it is not so clear that an instructOL's 
enthusiasm in teaching a course should be unrelated to student learninq in 
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the CDUCse. Trying to separate halo effect froi true ralationshlps aiong 
the underlyinq dliensions was further coiplicated by tho fact that 
the reliability of the student ratinqs was consistently higher than the 
reliability of the Instructor salf ratinqs. The higher reliabilities of the 
stulent ritinqs v\a a funstion 3f the fact that each student rating was the sea 
response froi different students, while each faculty self rating was based 
upon the response of only one individual (Sec Doyle G Crichton, 1978 and 
Harsh 5 Ovet-all, 1979). Nevertheless, if true relationships did exist 
between the different rating diiensions, then — as a consequence ot the 
higher reliabilities alone--correlations aaong the studant ratings would 
be highar than among instructor self ratings. 

Convergent validity coefficients were higher than correlations aiong 
instructor self evaluations, even when corrected for unreliability, for all 
but one fistor — Exaalnations/Grading . However, this sesond guideline was pnly 
partially satisfied when validity coefficients were coipared to correlations 
aaon g student riting factors; 23 of 72 comparisons (eight comparisons for each 
of the nine factors) failed this test and most of these were for comparisons 
iavDlving the Examinations/Srading and Organization factors, in general, these 
conclusions hold when this test is applied separately to each of the three 
sets of classes; failures of this test were more common in comparisons between 
vilidlty coefficients ani student ratings than in comparisons involving 
instructor self evaluations. Failures for instructor self ratings were 
most fraquent with the Examination/nradin g factor; failures for student 
ratings were most common with the Examination/Grading and Organization 
factors. 

Their third and final guideline is that the pattern of intercorrela tions 
aaoug different factors should be similar in both the triangular and square 
matrices. For example, there were four correlations between the factors of 
Enthusiasm and Learning; the correlations between instructor self evaluations o' 
Enthusiasm and Learning (.29--upper left triangular matrix), student ratings of 
Enthusiasm md instructor ratings of Learning (. 21 —below the diagonal of the 
sguare matrix) , instructor ratings of Enthusiasm and student ratings of Learnin' 
(.10--ibove the diagonal of the sguare matrix), and student ratings of Enthus- 
iasm and Learning (. ()5 — lower right triangular matrix). Inspection of Table 3 
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reveals that these four correlations wore aionq the highest In each af the 
rospoctlve sets of correlations. Por the nine factors there are 36 possible 
pairs of differont factors, and the relationship betveen ratings of each af 
these 36 pairs is rapresantad by four correlations (e.g., the four correlationr. 
between Enthusiasn and Learninvf described above). To test the sinilarity of th 
pattern of 3orralations aaonq the different factors, the 36 correlations aaonq 
the instructor self ratings were correlated with the corresponding 36 
correlations among student ratings. The pattern was quite similar (r = .a3, 
P< -01), implying that some of the covariation amonq factors represents a 
true relationship among the underlying dimensions rather than a siaple halo 
effect. 

An alternative approach, based upon aultiple regression was also i^sed 
to explore thr? n ulti trai t- nultiBethod matrices, in the first stage, multiple 
regression was used to pradict each instructor rating using the entire set of 
nine student ratings, and to predict each student rating yith the entire set of 
nine instructor rating factors. Por each of these 18 regressions, two aspects 
were of particular interest: 1) hov much variance beyond that explained by the 
matching variable alone could be accounted for by the entire set of nine 
variables, and 2) how auch of the variance explained by the entire set was 
uniquely due to the matrhing variable. The unigue contribution was taken to be 
the chango in multiple R squared (adjusted for the number of variables in the 
equation) due to the matching variable when it was entered separately as the 
last variable in the regrassion equation. For example, instructor ratings of 
Enthusiasn alone were able to explain 22% (before correcting for unreliability) 
of the variance in student ratings of Enthusiasm (See Table 5). The entire set 
of 9 instructor ratings was able to explain 2UX of the variance in student 
ratings of enthusiasa--an addition of only 1%. Furthermore, aost of the var- 
iance that could be explained by the entire set of nine variables was uniquely 
due to the matching variable (18X of the 2«iX) . Averaged across all nine studen' 
rating factors, the matching instructor rating variable alone could explain U% 
of the variance in student ratings, while the entire set of nine instructor 
ratings could explain 17X — an addition of only 3X. Furthermore, 13% of this 17* 
was unlguely due to the matching variable alone. Consequently, little 
variance in student ratings that was explained by student-instructor 
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aqreeient on tho sane factor could be predicted by any of the other eight 
iostructor rating factors (1X ot the laX). 

The results of the first stage of lultiple regression analyses has 
iiplications of particular interest to this study. Host iaportantly, 
virtually none of th« variance in student ratings that could be explained 
by instructor ratings on the sane factor could be explained by any sf the 
other instructor ratings; only 1% of the laX. This finding offers stroag 
support for the conclusion that student-instructor agreaaent on sach 
particular factor was distinct froa agreeaent on other factors. It also 
argues for the inportance of using aultifactor evaluation instruaents that 
have been developed witK factor analytic techniques. 

The second stage in the nultiple regression analysis was to predict 
each student rating with the eight other student ratings and the one 
■atching instructor rating, and to predict each instructor rating with the 
other eight instructor ratings and the one natching stulent rating. Foe 
«ach of these IB reqresfiions, the unique contribution of the aatching factor 
was deternined as described in the first stage. This analysis was directed 
to the issu3 of a halo effect. Specifically, how ouch variance in student 
ratings could be explainad by the renaiaing eight student factors, how 
much adiitional variance could he explained by the rating of the saae variable 
by instru:: tors , and how auch of the variance in student ratings that was 
attributable to student-instructor agreeaent on the saae factor could also be 
explained by other student factors? Averaged across all nine factors for all 
courses the other student rating factors explained 39X of the variance and the 
■atching instructor self-rating uniquely accounted for an additional 8X. 
This suggests that there is considerable covariation aaong the student 
rating factors beyond, that which can be explained by either 
student- instructor agreeaent on the saae rating factors or even the 
relationship betvesn each student rating factor and the entire set 
of instructor self rating factors (the analysis perforaed in stage one of 
the aultiple regression analysis). The saae conclusion does not hold for 
the instructor self ratings, on the average, covariation aaong the instructor 
self ratings factors accounted for only 109 of the variance within the factors, 
and the aatching student rating factor uniquely contributed an additional 
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12* to the total vacUnsa that could be explained. These findings shov that foi 
the stuient ntinqs thers is the possibility of a substantial halo effect, 
but that there is little indication of a halo effect in instructor self 
ratings. 

Tha sase nultiple regression analyses were perforned on each of the 
three sets of classes separately (See Table 5). The findings of each of 
these separate analyses were siailar to those reportod for all classes, 
la each of the three set of classes, student-instructor agreanent on the 
saae evaluation factor was reasonably distinctive fros agreeient on other 
factors, &ni aost of tha variance in the student ratings that could be 
«!cplain«d by the entire set of instructor ratings was uniguely due to the 
student-instructor aqreaasnt on the saie factors. Purthernore, there was 
evidence suggesting a halo effect in each set of student ratings, but little 
hal3 effect in any of the instructor ratings. Covariation aiong student 
ratings for courses taught by teaching assistants was soaewhat larger than 
in other courses, but more of this covariation was explicable in tens 3f 
covariation asong instructor ratings as well. 

Insert Table, 5 About Here 

Across all nine evaluation factors st udent-inst-ructor agreenent was 
generally high, but the extent of the agreement did vary considerably. In 
particular, except for graduate level courses, there was lover agreement on 
the Bxamina tions/Srading factor. Factor analyses of the student and instructor 
ratings (Sea Table 2) indicated that the dimension was well defined, and its 
reliability was comparable to the other factors(See Table U) . Examination of 
agreement on individual items (See Appendix VII) suggests the lack of good 
agreement was consistent across each of the three items designed to measure 
the factor, but was particularly marked for the item *«mathods of evaluating 
student work were fair and appropriate", the correlation for this item was 
the lowest of the 35 items and the only one that failed to reach statistical 
significance. Furthermore, the difference between mean instructor and mean 
student rating— instructor ratings were about one-guarter of a category higher 
on a five-point response scale — was also one of largest for any of the 35 
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indlvirtial iteaa. Differences ranged froa ♦0.37 (higher stuient ratings on the 
itei «Instructo' enhancod presentations with the usp of huior") to -0.27 (higher 
instructor ratings on the iten "aethods of evaluating student work were fair and 
appropriate") ; instructor self ratings were significantly higher on 6 items and 
significantly lower on 10 Iteas. 

rhe lack of good agreement on the Exa aina tiors/Giading factor is 
difficult to interpret. Results of the factor e.nalysis and the factor's 
reliability both suggest that the factor is "rcfil", and intuitively it would 
ae«a to be an iaportant aspact of teaching. Perha^js, instructors iust have 
no basis for assessing the guallty of their ex aainat ions, and the student 
ratings might he valid even though they do not agree with instructor self 
ratings. In fact, other research has shown this factor to be valid when 
the validity criterion was ob-jective student learning (Prey, 1^73; overall G 
Harsh, 1978) or student ilsposltlon towards further stu^y and application of 
the courso content ( Overall C Marsh, 1978). However, the lack of convergent 
validity leaonstrated in this study also has implications for the discussion 
of divGcgant validity as well. Each of the guidelines proposed by Campbell 
and Pisk(1959) Involves a comparison between a convergent validity coefficient 
and some other correlation coefficient. If the convergant validity for a 
factor Is low, the factor will automatically fail the divergent validity 
tests. Any correlation between this factor and other factors will appear to 
be halo effpct. in particular, comparisons Involving the 
Bxamlnatlons/lradlnq fa:;tors most freguently failed the divergent validity 
tests and contributed to the conclusion that there w as a halo effect present. 
This was true In spite of the fact that other sources suggest that at least the 
student ratings of this factor may be more valid than suggested by the lack of 
agreeaent with Instructor self ratings. 

In suaaary, several different approaches have supported both the convergent 
and at least one aspect of divergent validity of the teacher evaluations. 
The coarergent validity of the teacher evaluations— agraement between student 
and Instructor ratings on the saae factors — was corsistantly deaonstrated for 
each of the nine factors considered across all courses combined and within each 
of the three sets of courses considered separately. Student-instructor 
agreement on the sue factor was shown to be independent of agreement on 
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dlffecoof factors and could not be explained in teris of a genecali'jed 
efaluation factor that was coniiDn to both student and instructor ratings, 
thus illustratinq one aspact of diveuqent validity. Th9 question of a halo 
effect^ pirticularlv aaonq the student catinqs, Mas lore coaplica ted. rhe 
siailarity of the pittern of relationships aionq student ratinqs and the 
corcospandinq pattarn aiaonq instructor catinqs iaplies that sone of the 
covariatian aiong factors represents true relationships aaonq the underlying 
dliensionn. Pact of the elevated correlations aaong student ratinqs as 
coaparea to instructor ratings was a function of the higher reliabilities 
of the student catinqs. Pucthecaore, soae of the apparont halo 3ffect aiong 
student ritlnqs aay also be a function o£ a lack of validity in the instructor 
ratinqs; pirticularly with the Exaain ation/Rr adinq factor. However, there was 
still a strong indication that there aiqht also be soae halo effect underlying 
the student ratinqs, though probably not the instructor ratinqs. The 
existence of soae halo effect in the student ratinqs, if it does exist, does not 
undaraine either the convergent validity of ihe teacher evaluations or the 
specificity of the student-instructor aqreeaent on diffarent factors. 

DISCUSSION 

rnstructors evaluated tha effective enss of their own teachinq and were 
evaluated by their studants on the saae 35 item evaluation fora in a total 
of 3 31 different courses. The study included underqraduate and graduate 
level courses taught by faculty and undergraduate coursas taught* by 
teachinq assistants. In spite of faculty scepticisa conceminq the validity 
of student ratinqs and their belief that aany sources of potential bias do 
substantially iapast the ratinqs, there was qood student-instructor aqreeaent. 
Separata factor analyses of student and instructor self ratinqs both resulted in 
the saae set of nine evaluation factors that had been previously identified. 
This suqgests that siailar diaensions underlie both student and instructor 
evaluations. Correlations between students and instructor on the saae factors 
were generally high (aedian r » .45) and always statistically significant, whilr 
correlations between stuiant and insttuctor ratings on different factors tended 
to be low (aediin r » .00) and generally did not reach statistical significance. 
This ariues for the validity of the ratings in general, and for the 
distinctiveness of the diffarent factors. While the validity coefficients 
vere slightly lower for graduate level courses— aedian r = . 39 as opposed to 
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• U1 and .'46 for undergraduate courses taught by facultf and teaching assistants 
respecti velf— the general conclusions based upon the entire sot af courses were 
also true for eich of tha three sets of classes considered separately, rhis 
offers efldanse for the validity of student ratings at all levels of university 
teaching. 

Sevaril alternativa approaches were used to explore both the convergent 
and divergent validity of the teacher evaluations. Convergent validity, that 
which is hirpically doterained, refers to the relationship between student and 
iQstri\ctoc ratings on the saae evaluation factor. The results of the study 
offered clear support for the convergent validity of teacher evaluations. 

Divergent or discriminant validity was assessed by seeking the answers 
to two related qupstlons. First, is the stu dent-instru3 tor agreement on the 
same eviluition factors specific to that factor, or can it explained in tens 
of a generalize! agreensnt common to all the different factors? 
Second, are the correlations between the different factors as evaluated by 
faculty and students indicative of a halo effect, or do they represent true 
relationships among the underlying dimensions? The answer to the first 
gueition 'as quite clear; student-instructor agreement on the saie evaluation 
factors was specific and distinctive from other factors. While correlations 
between stuient and inatructor ratings on the same factors were uniformly high, 
correlations between thalr ratings on different fac .ors were generally low. 
Furthermore, virtually none of the variance in student ratings that could be 
explained by student-instr us tor agrcsement on the same factors could be explained 
by instructor ratings on any of the other eight factors. 

The question of a halo effect was sotieuhat more complicated. 
Correlations among the aifferent student factors (median r » . 33) were 
definitely lower thin the reliabilities of their ratings (median r - .94), 
but were higher than those among instructor self-ratings (median r = .09). 
Part of this could be explained in terms of the lower reliability of the 
instructor self ratings (median r = .82), and some of it could be explained 
in terms of a true relationship underlying see of the factors 
indpendent of the methoi of collection. Furthermore, for ratings of 
Examinations/Sriding in particular, a possible laclc of validity in the 
instructor ratings would give the appearance of an inflated halo effect in the 
stud'-t ratings, even though alternative criteria have supported the validity of 
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stiilent TAtlnqs for this factor. However, the results still suggest that 
there was at least sone halo effect in the student ratings. There was 
little evidence for any halo effect in the instructor salf ratings. 

rhroQ previous studies nost cosparahle to this investigation reported 
convergent validity coefficients of .U7 (Doyle and crichton), .62 (Webb and 
• Nolan, 195»i) anl .U9 (Marsh, Overall and Kesler, 1979). Two of these studies 
(Doyle B :richton, 1978; Narsh, Overall 8 Kftsler, 1979) also consdiered the 
dlvartrent vilidity of tha t»*acher evaluations. Doyle and crichton found little 
support for the discriminant validity of the ratings, but their study was based 
upon correlations aaonq only 10 different sections, and they considered ratings 
of individual itctns rather than evaluation factors. In the Harsh, Overall, 
and Kesler study, there was good support for both the convergent validity 
and the divergent validity of the student ratings. The results of the present 
study proviie a strong raplication of this previous finding. 

Many researchers (e.g., Hhitely 5 Doyle, 1978; Hirsh, 1078; Beatty 6 
larsh, 1975; Proy, Leonard »> Beatty, 1975; Pinkbeiner, Lathrop 6 
Schulergoc, 1971: Hildebrand, Hilson 6 Dienst, 1971; Bendig, 195U) 
have used factor analytic techniques to identify distinct dimensions 
that underlie student ratings of teaching guality. Prey (1978) recently 
argued for the existence of two distinct factors which he labeled as 
"pedagogical s)(ill" and "rapport". He demonstrated that his skill factor was 
■ore reliable and more closely related tc obiective student learning, while his 
rapport factor was correlated t'ith class size and expected grade. While Prey*s 
study (1978) did not demonstrate that there were only two factors (his factor 
factor analysis was based upon only seven items and several of these had 
substantial loadings on both his factors) , it convincingly showed that 
different components of the student ratings have guite different meanings. 

« 

Overall and narsh(1978) alsn found that some evaluation components (e.g., 
Tnstuctor BnthusiasB and Overall Instructor Rating) were more closely related to 
obiective student learning, while others (e.g., Learning/Value & Overall Course 
Rating) were more closely related to student disposition towards further study 
and application of the course content. Other findings presented in Appendix Vli 
of the present study show that for both student and instructor ratings, student 
prior subject interest was more highly correlated with Learning/Value 
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than with other factors. Siiilarly, course enrollaent was hiqhiy 
corrolat(>l with quillty of Group Interaction, but not with other factors (also 
see Harsh, Overill S Keslar, 1979b). 

Th? studies abovs aach acque for distinctive interpretations of the 
■eaninq of different student evaluation factors. Yet, in spite of this 
qrowlnq evidence to the contrary, soae users of student evaluations-- student s, 
faculty, adainistratocs, and even researchers — continue to assuae that all 
the useful infornation can be obtained froa a sinqle ratinq or slaple average 
of ratinqs. The findings of this study offer draaatic avidence that this 
is not so, and arqua for the distinctiveness of the different evaluation 
factors. Stuient-instrus tor aqreeaent on each of the nine evaluation factors 
was independent of their aqreeaent on the other factors. While there was soae 
evidence for a qeneralized factor within the student ratings, perhaps iadicativo 
of soae halo effect, it did not contribute to the specific student-instructor 
aqreeaent on the saae factors. In fact, correlations batwean student and 
instructor ratings on different factors were qenerally quite low. This 
con'jlusion argues for tha use of aultifactor evaluation instruaents that have 
been carefully onstructe I with the use of factor analytic procedures. 

Students' evaluations of taachinq effectiveness will not be useful unless 
faculty and adainist ra tors are convinced of their worth. While researchers 
have deionstratcd their reliability, validity, and relative lack of bias, 
aany faculty reraain sceptical. This scep:icisa, whether iuslified or not, 
will continue to underaine the value of the student ratlnqs until they have beer 
validated aqainst criteria that are acceptable to aost faculty, in the present 
in vestiqation, student ratinqs were validated against instructor self 
evaluations of thnlr own teaching effectiveness. This criterion, besides 
being acceptable to aost faculty, has two distinct advantages. First, it 
can b(> applied to all levals of instruction; student ratings were successfully 
validated against instructor self evaluations in graduate level courses 
and courses taught by teaching assistants as well as undergraduate 
csurses taught by faculty. Second, instructors can be asked to evaluate 
their teaching along the saae diaensions eaployed on tha student rating 
fori; in the present study it was shown that student -instru: tor agreeaent on 
any one factor was independent of agreeaent on other factors* Tn suaaary, the 
findings of this investigation establish the validity of student ratings at all 
levels of postsecondary education* demonstrate the importance of the distinctive 
evalnation factors, and should also be helpful in overcoming faculty 
j'^servations about the usefulness of student ratings. 
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Fasulty Attitudes Toward Students' ETaluations of Teachlnq Effectiveness 
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Course Level (upper 
division vs. lower) 

Students' Scholastic 
Ability Measured by GPA 

% Frosh 5 Soph Students 
in the Class 

Instructor's Age 



tions suggest that some 
the 
of 



28% Instructor ' 3 Appearance 

15% Instructor's sex 

8% Instructor's rank 

63% Instructor's popularity 

28% % Df Students Kajoring 
in A Departaent 



Student's Prior Knowledge 
of Course Content 



NOTB: Only faculty responses were included in this table. Attitudes expressed by teaching 
assistants to the first 9 items were siailar to those of facultv except that they 
expressed even stronger igpeeaeni witti the statea9nt Qndprsing th? laportaijc? of soiae 
n»^«^iir.n o' '>^^>c*iv» t-paTKfm bping nivon aore emphasis in promotional dec:isi 



isions. 
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Pi3 tor Afliil fo^9 Q 

f 4CUi 

Ifiluition lfm% (pariphriiiad) 

|lhafic«d ;c«|«Dtitioiii vitli Huaor 
OfRlAtt T«5TI»0CTOII lATtWQ 
Jft^ OHlJIItZATTOV 

Uotiir«s Paellli 

■ flWJ.TfTlHICTIOl 



Itattd Not* T«kiA9 

if opoop n 

KnoiiraqM iipr^ifiion of idoas 

rrioadlf MfAMx sindnt^s 

tRtorMtfd It Xnditlaail itudeoti 
AeeoasUU to indivliiitl studoAt? 

01ae«»9o4 Curroiit Oovolopionti 

fit IIA«tlATlpllS/6PI0lNG 
Jf«i VI**?" fAir/lopropriito 

fill ASStlVPRNtS 
fM4illOt/ro«tl filfliblO 

Atldod to roorao OnStrottnHa^ 



Coara# VorKloid Ufqh t*ft44 tTl 
Cjitrso Pico }too*fl6» - Too K 
Noars/vooii lataldo of CUsa 



«at| 



Pictor Pittorn Loiiln^a 
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INSTRUCTOR SELF EVALUATIONS 24 



Miltltralt-lteltlMthed MttrlRi Corrclatlent BttM^li'stHtfMt wttf PacHltw $«lf IvaiMtlent in All 331 Coutmi 
IW^mCTW SfUf^CVMjUATION PACTOMS 

fSCSrlS ScT«$ ***** 





(•3) 
























IITMUSIASM 


29 


(Ul 






















<MANi2AT|0N 


12 


01 


(74) 




















MXJ^ INTfllACTlOM 


01 


03 


-13 


(90) 


















INQIVIOUNj MmRT -07 


-01 


07 


02 


(02) 


















13 


12 


13 


11 


-01 


(64) 














IMNINATIONS 


-01 


00 


26 


09 


13 


20 


(76) 












ASSIQMCNTS 


24 


-01 


17 


03 


22 


09 


22 


(70) 










MMiD/DIFFlCUiTY 


03 


-01 


12 


•09 


06 


-04 


09 


21 


(70) 








snnwr 

tVMiUATION FACTORS 


INS^UC^QR SCur«EV/«jUAt|CN FACTORS 
liEAM ENTHU OMAN UROUP IMOIV BROTH EXAMS ASION NMiO 


STUOCKT EVAIiUATlON FACTWS 
UEARN ENTHU ORGAN OROF INOIV OROnt ilUm ASION 


blMIMINO/V«MC 


(40) 


10 


•01 


00 


-12 


09 


•04 


06 


02 


(93) 






INTHUSIASM 


21 


(34) 


•04 


-01 


-0? 


-01 


•03 


•09 


-09 


43 


(96) 




0MANI2ATI0N 


17 


13 


(30) 


-03 


04 


07 


09 


00 


-03 


32 


49 (93) 


• 


iwp iiniucTioN 


19 


03 


-20 


(32) 


00 


-02 


-14 


-04 


-06 


37 


30 21 (96) 




iNOiviouAu nfmmr 


03 


03 


-03 


13 


(26) 


-19 


-03 


-02 


00 


22 


33 33 42 (96) 






20 


13 


09 


00 


-14 


(42) 


00 


09 


02 


49 


34 56 17 13 


(94) 


IXAMINAT|ON$ 


ia 


09 


01 


•01 


06 


-09 


(17) 


-02 


-06 


46 


42 37 34 30 


33 (93) 


ASSIONNENTS 


20 


03 


02 


09 


-01 


04 


-01 


(43) 


12 


32 


21 34 HO 29 


40 42 (92) 


•GMjO/DiFFiCUiTY 


-00 


-03 


04 


00 


03 


-03 


12 


22 


(69) 


06 


02 -03 -03 06 


16 -02 20 



(67) 

j2injlMJI^*.?ilfSr!l!MMi!/'?^.'2!* ■•!^'"•• ♦* trl«>aul«f MtrlCM. .r. reliability 

(OMfflelMt alplia cMfflelMtt (Sm NI«, tt. 19?7). Valwi In tiM diMonal of Iomt laft Mtrlx. the touara MtrlH 

yil Shii'-SrL'iU'lin:: J'^lri?.!"*: y •~'.M.biir?r.ccor<i„s to"th; ia.^;n ►o7ai?;2;!"' 
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TAdUE 4 

REUIABIUITY AND CONVERGENT VA«jIO|ty OF STUDENT AND INSTRUCTOR SE'jF RATINGS: SEPARA-^E ANA'jYSES 
FOR UNDERGRADUATE COURSES TAUGHT BY FACUj^Y (UF— 183 OjASSES), GRADUATE 'jEVPj COURSES TAUGHT BY 
FACUjTY <GF.-45 OjASSES), UNDERGRADUATE COURSES ^auGHT BY TEACHING ASSISTANTS (TA— 103 CUASSES), 

AND COMBINED DATA FOR AU'J COURSES (COMB-331 OjASSES) 

REUIABIblTY COEFFICIENTS VAblDITY COEFFICIENTS 

INSTRUCTOR SPjF-RATINGS STUDENT RATINGS 



6VA»jUATI0N FACTORS 


UF 


GF 


TA 


COMB 


UF 


GF 


TA 


COMB 


UF 


GF 


TA 


COMB 


UEARNING/VAIjUE 


.80 


.87 


.82 


.83 


.95 


.96 


.94 


.95 


.41 


.20 


.46 


.46 


INSTR ENTHUSIASM 


.83 


.83 


.82 


.82 


.97 


.97 


.97 


.96 


.48 


.60 


.62 


.54 


ORGANIZATION 


.79 


.78 


.59 


.74 


.93 


.95 


.93 


.93 


.28 


.41 


.31 


.30 


GROUP INTERACTION 


.88 


.83 


.94 


.90 


.98 


.96 


.97 


.98 


.54 


.46 


.39 


.52 


INOIVIDUAb RAPPORT 


.82 


.81 


.83 


.82 


.96 


.97 


.95 


.96 


.17 


.31 


.52 


.28 


BREADTH OF COVERAGE 


.79 


.72 


.87 


.84 


.91 


/J6 


.94 


.94 


.43 


.06 


.37 


.42 


EXAMS/GRADING 


.77 


.74 


.76 


.76 


.93 


.90 


.94 


.93 


.15 


.39 


.15 


.17 


VAbUE OF ASSIGNMENTS 


.77 


.64 


.50 


.70 


.92 


.94 


.88 


.92 


.33 


.20 


.74 


.45 


WORWjOAO/DIFFICUjTY 


.67 




.72 


.70 


.87 


.89 


.88 


.87 


.69 








MEDIAN VA'jUE ACROSS 
ALiU FACTORS 


.79 


.78 


.82 


.82 


.93 


.96 


.94 


.94 


•41 


.39 


.46 


.45 


OVERAUU RATINGS 
(SINGjE ITEMS) 


























OVERAlj'j COURSE 


















.27 


.17 


.17 


.26 


OVERALilj INSTRUCTOR 


















.36 


.20 


.24 


.33 



« 

NOTE: RPjIABIUITY ESTIMATES, COEFFICIENT AJPHAS (SEE NIE, ET. A'j., 1977) WERE BASED UPON the 

CORREUATIONS AMONG ITEMS IN the SAME FACTOR AND COUjD NOt BE COMPUTED FOR the SINOjE ITEMS. 
VALIDITY COEFFICIENTS FOR THE FACTOR SCORES WERE CORRECTED FOR UNRPj I ABIb ITY W|th THE 
SPEARMAN BROWN EQUATION. VAblDITY COEFFICIENTS FOR THE TWO SINGjE ITEMS WERE NOT 
CORRECTED SINCE NO REUIABIbiTY ESTIMATES WERE AVAIbABbE.. IT SHOWjO BE NOTED that 
THE VAblDITY OF the two SINOjE ITEMS, THE OVER^b RATINGS, WERE bOWER THAN the 
MEDIAN VAblDITY COEFICIENTS OF THE FACTORS EVEN WHEN NOT CORRECED FOR UNREb I ABIb |ty. 
id THIS IS PROBABbY DUE tq THE FACT THAT INOIVIDUAb JTEMS TEND tq HAVE bOWER 
ERIC RPjlABIblT|ES THAN DO FACTOR SCORES thaT ARf BASED UPON SEVERAb ITEMS. 
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▼•y9ht iy ftculty (UF-183 Cln%M>. QrHuttt Utvtl Coyrtti ^y^ht ty F#cyltyJ0F-43 Clmtt); UnitfortSyttl 
CouriM TiygM ly TMchlr>g Attlttinrt CTA-IOJ cItSMS) mi cJ^\n%4 0%U For All teirtM 'oaib--33r CloMi) 



WilAtlUITY 
CoffMcUntt 



S^UOfNT 

IVMjUATION FACTOnS 



UF OF TA CM 
95 96 94 99 



97 93 97 96 

93 99 93 93 

96 97 91 

97 99 96 
96 94 94 



96 



91 



UEAftNINO/VAbUS 
KNSTR ENTHUSIASM 
ORGANIZATION 
GROUP INTERACTION 
INOIVIDUAIi RAmMT 
IRCAOTH OF COVERAGE 
EX>MS/GRA0IN6 
VAUUE or ASSIGNMENTS 92 94 66 92 
VORNjOAO/DlFFlCai^Y 67 69 66 67 
MEAN AUi 9 FACTORS 94 94 93 94 



REl^lAatUITT 
Co^fMeftnti 



93 90 94 93 



INSTRUCTOR SPiF 
EVAIjUATION factors 



GF TA COMB 
60 67 62 63 



UCARNlNG/VAIiUE 
INSTR ENTHUSIASM 
ORGANIZATION 
GROUP INTERACTION 
IWIVIOUAb RAPPORT 62 61 63 82 
BREAOTH OF COVERAGE 
E)UMS/QRAOING 
VAbUC OF ASSIGNMENTS 77 64 90 70 
HORWiOAO/DIFFlOUiTV 67 71 72 70 

>CAN AUi 9 FACTORS 79 77 76 79 



63 63 62 6? 
79 76 99 73 
66 63 94 90 



79 72 67 64 

77 73 76 76 



2 h 
r vlth Mttchln^ 
INSTR U\i Rttlng 



^ OF TA cOMi 
I2» 031 161 I6f 

161 291 301 231 

061 I3f 061 061 
291 171 I4f 201 
021 071 211 06f 
131 OOf lOS Ml 
021 101 02S 021 
061 021 241 131 
26f 29f 30f 291 
I3f I2f I7f I4f 



r tflth Matching 
STDNT Rtting 



UF GF TA COMB 
I3f 03f I6f I7S 

I6f 29S 301 231 

06f I3S 06f 06S 

29f I7f I4f 2lf 

021 07f 21 f 06S 

I3S OOf Ml US 

on I OS 021 02S 

06S 02S 24S 13S 

26S 29S 301 291 

I3S lOf I6S MS 



2 c 
Mult R vlth I 
—Hitching INSTR 
—All Otr>#r INSTR 

(UnlqM *ar Dim To 
Mttchlna Rtting) 
1/ GF TA COM 

121 021 171 161 
(09f OOf ITS 131) 

231 261 361 24f 
CMS 26f 171 161) 

091 171 091 061 
(051 171 031 091) 

301 32f 241 271 
(211 191 09f 201) 

07f OOf 27f llf 
(04f OOf I9f 07f) 

I9f OOf llf I6f 
(I3f OOf 04f llf) 

03f llf I9f 07f 
(02f 06f 03f 03f ) 

llf OOf 29f I4f 
(06f OOf I9f lOf) 

26f I6f 44f 30f 
(26f I6f leS 29f) 

I9f I2f 24S I7f 
(lOf 09S I2f I3f) 



2 c 
m\f R vlth t 
— Matching IKS-^I 
—All OthiT SW 



Mgit R WITH i 
-Mttchlng INSTR 
—All Othtr INSTR 
—All Oth«r STONT 
(Unlqui Vr Out To (UnlQyt Vv- Out To 
MttchlM Rtting) Mttchlni Rttlng) 
GF tA CX)MB ^ OF ?A COMB 



Mult R vlth I 
—Matching STQNt 
—All Othtr STDNT 

(Unlqut Var Out To 
Matching Rating) 
UF GF TA com 

llf OOf 22f I7f 
(U7f OOf 01 f 06f) 

I6f 28f 35f 29f 
(I8f 28f 27f 29f) 

I3f I4f 06f I4f 
(07f I4f 06f lOf) 

33f lOf I6f 23f 
(231^ 08f I2f 20f) 

08f I9f 22f I Of 
(04f 04f 07f 09f ) 

23f OOf 29f 2lf 
(I9f OOf I3f I6f) 

I7f 30f I3f 07f 
(09f 29f Olf 04f) 

I2f 20f 32f I8f 
(08f 06f 22f I2f) 

27f 49f 3lf 29% , 
(27f 36f 24f 26f) 



92f 49f 99f 93f 
(04f OOf Olf 031) 

46f 99f 96f 90S 

(I3f 20f I6f I6f) 

60f 99f 69f 99f 
(03f 06f 04f 09f) 

49f 29f 43f 421 

(20f 06f 09S t9f) 

39f 29f 90f 401 
(03f 03f 04f 04f) 

92f 49f 46f 90f 

UOf OOf lOf lOf ) 

92f 90f 99f 90f 
(03f 2lf OOf 021) 

41 f 46t 46f 49f 

(09f 04S I6f 06f) 

33f 49f 9lf 39f 
(29f 39f I7f 29f) 

47f 49f 92f 47f 
(09S llf 09i 06;S) 



93f 6lf 6lf 94f 
(09f OOf 03f 09f ) 

92f 61 f <4f 93f 
(I7f 2lf nf |7f) 

92f 93f 66f 61 f 
(03f 121 04f 09f) 

49f 37f 49f 49f 
(I6f 05f lOf I9f) 

4lf 29f 92f 4lf 
A)4f 02$ 09f 09f ) 

41f 96f 49S 93f 

(llf OOf lOf lOf) 

94f 46f 98f 93f 
(04f 24S OOf 04f) 

42f 90f 46f 46f 

(07f Olf I4f lOS) 

39f 96f 93f 38f 
(20f 97f t7f I3f) 

46f 90f 99f 491 
(lOf I3f 09f 09f) 



Mult R vlth I 
—Matching STTNT 
—All Othtr |»<STR 



Mult R HITH t 
—Matching STDNT 
—All Othtr STONT 
—All Othtr INSTR 
(Unlqut Var Oua To (UnlqUE VAR Oja To 
Matching Rating) Matching Rating) 
UF OF TA COMB . 



29f 36f 34f 27f 
(06f OOf I9f I2S) 

23f 33f 30f 26f 
(I4f 29f 20f I6f) 

20f 46f 06f I6f 
(02f I7f 04f 09f) 

26f 34f 26f 24f 
(22f I4f Oef 20f ) 

nf 2lf I7f I2f 
(04f 04f I7f 07f) 

22f 02f 26f I7f 
(llf OOf 05f llf) 

I8f lOf 26f I6f 
(02f 06f U2f 03f ) 

33f 27f 20f 26f 
(09f OOf I7f 08f} 

3lf 23f 29f 30f 
(24f 20f 29f 29f) 



I8f I6f 23f I8f 
(I2f I4f I3f I4f) 



23f 26f 23f 22f 

(lof lof \n i2f) 



29f 33f 39f 3lf 
(08f OOf 04f 06f ) 

26f 34f 4lf 3lf 
(I9f 33f 23f 24f) 

22f 92f I4f 2lf 
(06f I2f 06f lOf) 

39f 32S I6f 28f 
(22f 071 lOf 20f) 

20f 27f 22f I6f 
(06f 02f 07f 06f ) 

29f 06f 36f 27f 
(I7f OOf I2f I6f) 

28f 5Hf 4lf 23f 
(07f 26f OOf 07f ) 

39f 36f 34f 32f 
(06f Olf 22f I2f) 

28f 61 f 27f 30f 
(23f 93f 27f 29f ) 



27f 56f 32f 27f 
(13f 19f 13f I4f) 



t-Ral lability tttl«atat. cotfflcltnt alphas (att NIa, at. al., 1977) vara baatd upon tht corrtlttlona 
anoog IttMt vlthln tht aaat factor. 

b— Thta« ara 8l«pla blvarlata corral at loAs(uncorrtctttf for unrti labi I Ity) that hava bttn aquartd 

^'^IJllIIi!'!*^?!!!'^.^^!**^'*"^* tnttring ttch att of ItaMt slmiltantoutly-corrtettng tor tht 

dl^ul tJl !/*^?"*^.f^*^^^*"*^ ^^•^ •"♦•^^"9 ♦ht ont -.ttchlng varlabit* on tha last atap tc 

!S tMplalntd by It. if R aquarad or tht chaoga In R aquart4 

th2 IhlLi I .I^M2"'!2^^ *^ ^5* ''"^•^ varlablaa In tha aquation) It »aa conaldtrad tTbt xtro; 
Sf^iSjrMllS^ n ^jrt^I^chLiIi*i!!*2\r*^J[" ^^<* ^^ol in toaa Inatancaa thara •arayld#;ca 
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ifi:> tesrs *''''^NOix I instructor self evaluations 

' 'W use EVALUATION SERVICES 

AS A DESCRIPTION OF THIS COURSE/INSTRUCTOR. THIS STATEMENT IS: 

(MUCTlMI Bt ST Wt WNSt >0W lACH Of THI yQUCT^'NG tlMtMCNTt HAVI»Ki A Rt»OWat BLAWW ONtV ir IT l» CHAWlT NOT HlllVAMTl 
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,1 LEARNINQ: vourouNOTHt oouiiuiKriujauAuvcHAu^MiNOANOiT^ 

2 YOU HAVE LEARNED ftOMrfHINO WHICH YOO OONSlOCR VAlUAtLE 

3 VOUM iKriREST IN THf tuajfCT HAS INCRIAHO AS A OONMQUtNCC C# THtt OOUMC 

4 YOU HAVE LEARNED AND UNOtRSTOOO THf SUBJECT MATERIALS IN THIS COURSE 

; 6 ENTHUSIASM: instructor \mas ikthus4astig Atour tiachino ths oouam 

6 INSTRUCTOR WAS DYNAMIC MO ENEROCTIC IN CONDUCTINO THE COURSE 

^ 7 INSTRUCTOR ENHANCfO WtSIMTATlONS WITH THtW OP MUMOR * ' 

8 INSTRUCTORS STYLE Of PRESENTATION HELD YOUR INTEREST DURING CLAM 
0 ORQANiZATION: instructors EXnANATtONS.WlMCtiAR 

1 0 COURSE materials WERE WELL PREPARED AND CAREFULLY EXPUINED 

( 1 1 PROROSED objectives AORtKD WITH THOBI ACTUALLY TAUOHT BO VOU KNCW WHIIIt OOUIWI WIAB QOINQ 

1 2 INSTRUCTOR QAVE LECTURES THAT FACILITATED TAKING NOTES 

(13 GROUP INTERACTION; vruofwrs wire enoouaaqio to PARTioPATf in oabb oibcubbionb 

1 4 students were invited to share their ideas and knowledge 

|l 5 STUDCNTB WCRE ENCOURAQCD to ask QUBBTtONB « WIM OlVf N MBAMNOPUL ANBWBM 

1 6 STUDENTS WERE ENCOURAGED TO EXPRESS THEIR OWN IDEAS AND/OR OUESTlON THE INSTRUCTOR 

j!l 7 INDIVIDUAL RAPPORT; instructor was PRiENocYTowAmB individual BruoiKri * 

1 6 INSTRUCTOR MADE STUDENTS PEEL WELCOME IN SEEKING HELP/AOVlCE IN OR OUTSIDE OP CLASS 

^ 0 INSTRUCTOR HAD A GENUINE INTEREST IN INDIVIDUAL ITUDCNTB 

20 INSTRUCTOR WAS ADEQUATELY ACCESSIBLE TO STUDENTS DURING OFFICE HOURS OR AFTER CLASS 

1^1 BREADTH: instructor contrasted the impucations oi^ various THBomBS 

22 instructor presented the background or origin OF ideas/concepts developed in CLASS 

^3 INSTRUCTOR PRESENTCD POINTS OP VIEW OTHER THAN HIB/HER OW?. WHEN APm>PVIIATC 

24 INSTRUCTOR ADEQUATELY DISCUSSED CURRENT DEVELOPMEIVTS iN THE FIELD 

^5 EXAMINATIONS: feedback on examinationb/graocd materials was valuabli 

26 methods of EVALUATING STUDENT WORK WERE FAIR AND APPROPRIATE 

|2 7 CXAMINATIGNa/QRAOCD MATCMALB TEBTCD COURSE OONTKNT AS EMPHASIZED BY THt INSTRUCTOR 

28 ASSIGNMENTS: required readings.texts were valuable 

^ * RCAONaS. HOMOWRK. fTC CONTRIBUTCD TO APPRiaATlON AND UNOCRSTANDiNQ OP BUBJBCT 

30 OVERALL how does this course compare with OTHER COURbES YOU HAVE HAD AT USC 



33 

35 

37 
338 
39 

|W 
41 



HOW OOeS THIS INSTnUCTOM OOM^AMI WITH OTHCN lUSmjCTim VOU HAVI HAD AT MOT 

STUDENT AND COURSE CHARACTERISTICS (LEAVE BLANK IF NO RESPONSE APPLIES) 
OOUftSC DtPFtCULTY. NCLATIVE TO OTHER COURSEt, WAS (1-VERY EASY_. »«(EDIUM ... S-VCNV HARO) 
COURSE WURKLOAO. RELATIVE TO OTHER COURSES. WAS (1-VERV LIOHT... 3-MEOlUM . WERV HEAVY) 
OOUNSt MCK WAS (1>T00 UjOW.. SAaOUT MOKT.. S-ttX) fACTI 

HOURS/WEEK REQUIRED OUTSIDE OF .CLASS II 0 TO a. 21 2 TO 6. 31 6 TO 7. 4) a TO 12. Qi OVER 12 
LEVEL or INTEREST IN THE MJtCT MNOR TO THIS COURSE (l-VCRV LOW... ^MEDIUM ... SAWRV HtOH) 

OVERALL OPA AT use II BE LOW 2 S 2) 25 TO 3 0 3) 3 0 TO 34. 4) 34 TO 3 7 S) ABOVE 3 7 
LEAVE BLANK IF NOT VET ESTABLISHED AT USC 

EXPCCrCO QRAOE IN THE COURSE (1-^. M>. SC. 44. frA) 

REASON FOFI TAKINO THE COURSE 11-MAJOR REOUIRE 2-MAjOR ELECTIVE 3-OENERAL ED REOUIRE 
♦-MINOR/RELATED FIELD SOENERAL INTEREST ONLY) - SELECT THE QNf WHICH IS BEST 

YEAR IN SCHOOL 1) FRESH, titom •) JR. 4)BlC HOIMa 



MAJOR DEPARTMENT 11 SOC SCl/COMM . 2) NAT SCI/MATH . 

61 ENGINE E RING. 7) PERF ARTS. 6) PUB AFFAIRS. 



31 HUMANITIES. 41 BUSINESS. SI EDUCATION. 
9)0THER. lot UNDECLARED/UNOECIOeD 



42 
43 
44 
45 



SUPPLEMENTAL QUESTIONS (USE RESPONSES BELOW FOR INSTRUCTOR S QUESTIONS) 



.1:: 

.-|:: 
:1:: 



4f 

ERIC 



9- 

9- 



■ A: 
:;♦: 



47 
48 

49 "i" 
50 

51 "it: 



:^: 



52 

53 

54 
55 
56 



31 



57 



60 



VERY 
KX}R 


POOR 


MOO 

ERATE OOOO 


VERY 
OOOC 












































''9' 








:.i:: 






















:*! 
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^1 




































zsir: 




















isl:: 










:r1:: 














:i»: 






:n:: 










s4s: 


::»: 




ta*: 




















sa*t 




tri:: 






:s*: 
























::1:: 










z^u 


























:=*-- 




















=^ 


tut: 








:-4.: 




=1" 






























::1:: 


:.-2:: 








::1:: 
















■.■■A: 
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Amwii II. 

riCttr NtUm iMllllfl 



iMrtiiH iRlfffll it (Mftt (tAlftMlKt 

mm\ comu Mriii 

mifiitdMlIf (ii«llfii|(<VUl«lalliif 
II. IN1I«AIA)M 
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OOHVrRGfN^ ANO OiSd^iMlNAN^ V^jlOrY: CmrjAT|UNS amfFCN INS^UC^CM Srjr-rVAbUA^lONS AND SWN^ EVAbUATlONS FOR 
AliU UN3€RGRA0UArE >jCVF\i COURSES TAUCH^ 8Y TEACHING ASSISUNt^ <N«10> COURSES) 

INS^RUC^M SPj^-EV/MjUA^ION fAC^ORS 
IMS^^OR Sr^r UEARN EN^HU ORGAN GROUP INOIV BRO^H EXAMS ASiGH MCjO 
EV<SJUAT|ON FAC^Oi^S 
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INSTKXTQR SrjP-EV^UAT|ON FACT^^S STuOENT EVAUUAT|0N FACTWS 
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MOTE: Values In tht dl^gonaU o# th« upptr Uft .^nd lontr right matr lets, th« two trUngultr matrtcts, ara rtllabMlty 

(coaMiclant alpha) coaMiclanti iSae NIa, at. al., 1977). Values tn tha diagonal of lo«or laft matrix, tha squara natrU, 
ara eonvtrgant validity coaMiclents that hava baan eo^rtctad for unrtilibMlty according to the Spaarman Bro»n aquation, 
Tha nina uncorractad validity co^Mlclanti, starting ■Ith 'jaarnlng «ould ba ,40, ,55, ,23, .37, .46, .33* ,13, .49, 4 ,55, 
All corrtiatlon coafflclants art prasontad without decimal point, Corralatloni grtatar than .19 ara statistically significant. 
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V INOIVtOUAlb RAPPORT 

21:^' sr:r J:? ::!::!;. iir :S5 'fr m •»:: •»■• 

intaraatad In Individual Studanta 4 0 4 3 - 27ii 26" a Ha Va 1!. '??!! 'J' <'2 <-3 --'S" .18" 

" il?" J:! J:? :;j ;jr J:S J;l J:f 5:1 ::fj:: 

VI BRf AO^H or COVfRAC£ 

Contrasted l«pHcatloni 4.1 3,9 .12 

Gav« Background of Idtdi/Conetpts 4.1 4.0 .09 

Cava OM^arant Pointi of Vla« 4.I 4.1 ,03 

OlicutMd Currant Oavalopnicnta 4.1 4.:{ ^.OS 

Vtt fXAMINA^IONS/QRAOlNG 

(HMlnatlon Faadback Valuabia ).6 3.7 ^.06 

(val Mathods Falr/Appropr lata 3,6 4.2 •.56» 

Tastad Enphasliad Couria Contant 3.9 4.2 -.27» 

VIII AS^IGM;fNV$ 

Raadlft9i/ta4t» Valuabia 3.B 4.0 -.12 

Addad to Coorsa Undar standing 3.9 4.0 -!o6 

IX MHOjOAO/DIFFICUi^Y 

Couraa Difficulty (Ea»y-H«>d) 3.5 3.6 -.03 

Ceuraa Moriiload (U Ight-Haavy) 3.4 3.4 .00 

Couraa Paca l^oo Slo» - ▼oo Fast) 3. 1 3.0 .09 

Hours/waali Oytslda of Clasi 2*7 2,9 -•25* 

•COIAH CORRfliAtlOW FOR 35 l^fMS ,23 ,22 31 3P 

• P I a05, p i .01 

m !ol!lJs*IIi;fIl"mrVVl\^* -bsoluta agrtt-^nt (paan diffarancts that appaar under 

corral. t ions undar tha colics latalad "CORR-) sinca It .a. assuIaTtlSt't"! ;i;:l::.^s In*^ poslt.o. 



.35" 
.19" 
.14i 
.16" 


4.2 4,0 

4.1 4.1 

4.2 3.9 

4.3 4.2 


.20 
.03 
.32 
•09 


.06 
.20 
.05 
.32» 


3.6 3.4 

3.7 3.4 
3.9 3.6 
3.7 3.4 


.34»i 

.35" 

.10 

.26" 


.27ii 
.32" 
.16 
.36" 


4.0 3.6 

4.0 3.6 

4.1 4.0 
4.1 4.0 


.20»i* 
.16«« 

.06 


.35" 
.3I«» 
.16" 
.39" 


.16" 

.01 

.14» 


3.9 3.7 
4.2 4.1 
4.1 3.9 


.19 
.05 
.19 


.35»» 

:S3 


3.6 3.8 
3.6 4.0 
3.9 4,1 


-.04 

-.21*» 

-.16* 


.04 
.14 

.11 


3.7 3.7 
3.9 4.1 
3.9 4.1 


-.02 
-.27»» 


.16" 

.05 

.14»» 


.27" 
.26" 


4.2 4,2 

4.3 4.2 


.04 

.13 


.11 

.24 


3.7 3.7 
3.7 3.7 


-.04 

.02 


.51" 
.27" 


3a6 3.9 
3.9 3.9 


-.07 
-.02 


.34«» 

.32" 


.33" 
.50«» 
.13» 
.34" 


3.6 3.6 
3.6 3.6 

3.1 3,1 

3.2 3.7 


.02 
-.05 
.02 
.50" 


.36 
.45" 

.01 
.35" 


3a1 3.3 

3.2 3.1 
3.2 3.1 
2.5 2.6 


-.07 
.14* 

-.10 


• 40" 
.46" 
.02 
.24ii 


3.5 3.4 
3.4 3.4 
3.0 3.1 
2.7 2.9 


-a04 

.03 
.09 
-.24i» 


.41«i 

.50" 

.10* 

.4«.»« 
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hPPinm VI 

?2"!ft5^i?S?.?r*'^?I! JTUDEfiT/XNSTRUCTOR/COURSE CHARACTERISTICS AND STUDENT KVALUAYION.^ OF TEACHING CFFECTIVENEW (THE VALUES NOT 

IN FARCNTHCSES), AND FACULTY SELF -EVALUATIONS OF THEIR OWN lEACHINC (THE VALUES lii •ARtNTHESES) . Ntip UNDEhCR ADUATE COURSES. 

STUDENT^COURSC/INSTRUCTOR BACKGROUND VARIABLES EVALUATION FACTORS AND OVERALL SUMMARY ITEMS 

OVER OVER 

•».mP.»- .•nr.-., . ..... . . ^^^"^ ^^^^^ ^^^^^ ^NDIV BRUTH EXAMS ASICN WRKLD CRSF. INSTR 

STUDENTS RATING "STUDENT'S PRIOR SUBJECT iNTt !EST" ^^^ 25 06 3o 16 07 U 22 20 3*1 28 
S-HIsJH) ( 22) ( 20) (.OS) ( 23) (•12) (.10) (-19) (.Qo) ( 00) ( 16) ( 15) 

FACULTY RATING "STUDENT'S PRIOR SUBJECT INTEREST" 2S 21 13 16 }H Ok 09 11 .0<l 25 22 

(^•^Ok 5-HIGH) ( 36) ( 26) (-01) ( 06) ( OOJ (-2*1) (.02) ( 1 1 ) ( 03) ( 29) ( 12) 

STUDENTS RATING "COURSE WORKLOAD/DIFFICULTY" 20 06 01 Ok 06 16 Ok 23 100 26 1« 

(HIGHER SCORES DENOTE MORE DIFFICULT COURSES) (.03) (-OH) ( 03) ( 03) ( 00) ( 00) ( 21) ( 15) ( 53) ( 1?) ( 0?) 

FACULTY RATING "COURSE WORKLOAD/DIFFICULTY" 06 02 01 -03 Ok 02 05 12 5^ 1^ nfi 

(HIGHER SCORES DENOTE MORE DIFFICULT COURSES) ( 07) ( 03) ( 15) (-09) ( 10) (-Oo) ( 21) ( 21) (100) ( 29) ( I0) 

STUDENTS RATING "EXPECTED COURSE GRADE" 28 . 20 05 36 16 01 26 2k -25 26 27 
( 1 1 ) (-03) (-07) ( 17) (-10) (-1 1 ) (-1 1 ) ( 02) (-19) (.01) ( 00) 

FACULTY RATING THEIR "GRADING LENIENCY- .0*1 -lo -Oo 06 -08 -05 -05 -02 26 .Oh .10 

(l-EASY/LEMEHT GRADER. ... S-HARD^STRICT GRADEP: ( 00) ( 0*1) ( 06) ( lo) ( 1*1) ( 08) ( 32) ( 19) ( 26) ( U) ( 03) 

STUDENTS "S INDICATING INTEREST AS REASON FOR TAKING CRSE" 06 10 10 -10 -10 21 03 IK 16 09 06 

(AOTUAk PERCKNTAU) ( 09) ( 06) ( 12) (-13) (-07) ( 10) ( 10) (-06) (-12) ( 18) ( 09) 

■COURSE ENROLLMENT" .2^^ -0*1 .13 -36 -21 -09 -22 -09 -07 -16 -20 

(ACTUAL NUMBFR OF STUDENTS ENROLLED) (-02) ( 03 ) ( 10) (.413) (-17) (-03) (-03) (-1 1 ) (-oi) (.o2) (J?) 

■PERCENT OF FRESHMEN I SOPHMORES IN CLASS" -21 -1 1 -05 -36 -19 -05 -1] .10 -10 .>17 19 

(ACTUAL PERCENTAGE) • (-12) (-03) ( 15) (-27) ( 05) ( 0*1) ( 00) (-01 ) ( Ok) (-05) ( 00) 

FACULTY "NUMBER OF TIMES HAVE TAUGHT THIS COURSE" -0*1 06 10 -15 00 05 -09 -1 1 00 -02 03 

(ACTUAL NUMBER OF TIMES) • ( 05) K 09) ( 20) (-1 9) ( 15) ( ID (-03) ( 1 1 ) (-0*1) ( 03) ( 17) 

FACULTY "YEARS TEACHING IN HIGHER EDUCATION" .06 •OM -06 00 13 0*1 -10 0*1 -02 -06 -01 

(ACTUAL NUMBER OF YEARS) ( 09) (-10) ( 12) (-0*4) ( 12) ( 06) ( 0^) ( 17) ( 00) ( 05) ( 05) 

FACULTY RATING THEIR OWN "POPULARITY WITH STUDHNTS" 29 37 31 17 09 17 17 17 01 35 38 

(1. EXTREMELY UN PO PULA R . , . . 5-EXTREmELY POPULAR) ( 30 ( 3?) ( 13) ( 07) (-01 ) ( 03) (-07) ( 05) ( 02) ( 32) ( *ie) 

FACULTY RATING SELF AS "TEACHER IN UNDERGRADUATE CLASSES" 31 <I2 30 -03 00 19 16 Oo 10 31 37 

(l-WELL BELOW AV: 5-WELL ABOVE AVw) ( 30) ( M2) ( kO) (-12) ( 06) ( 0*1) ( 16) ( 13) (-05) ( 25) ( **6) 

FACULTY RATING "ENJOY TKACHING RELATIVE TO OTHER DUTIES" 25 3*1 18 22 33 00 20 09 03 29 32 

(1-EXTRMLY UNENJ0YAbLE..,5-EXTRMLr ENJOYABLE) ( 2k) ( 39) ( 01 ) ( 10) ( 12) (-21 ) (-20) ( 03) (-03) ( 15) ( 22) 

FACULTY RATING " EASE OF TEACHlKG THIS PARTICULAR COURSE" 07 -01 10 11 06 09 09 01 05 ' 03 06 

(l-VERY EASY 5-VERY DIFFICULT) (-12) (-16) (-07) ( 17) ( 12) ( 06) ( 05) ( 0*1) ( 17) (-1*1) (-10) 

faculty' RATING "SCHOLARLY PRODUCTION IN THEIR DISCIPLINE" 12 02 16 0*1 06 2t Ok 17 11 IK 16 

(1-WELL BELOW AVJ... .5.WELL ABOVE AVG) ( 28) ( 20) ( *I0) ( 09) ( 11) ( 2o) ( 25) ( 25) ( 10) ( MO) ( HI) 

FACULTY RATING "STUDENT EVAL^i ARE ACCURATE A^.SESSmEnT OF k) 36 26 27 27 I0 2*1 16 09 4»2 

TEACwiNG** (l-f^TKCNGLY DISAOJ^KK 9-riTkONGLY AGf^FE) ( 17) ( 2o) ( OM) ( 00) ( 00) (-09) (-12) (-02) ( OM) ( lo) ( 26) 

FACULTY RATIN'I •'SYUDRNT EVALfi POTENTIALLY USEFUL FEKDaACiK 27 3M 1M 29 27 05 16 08 -01 33 31 

TO FACULTY** (U?^Th0NCLT DinAGHEE. . . . 9-STKONGLV AGREE)* ( 17) ( 22) { 02) (-01 ) (-07) (-23) (-15) (*12) ( 07} ( 11) ( 15) 

correlation:. BFTWF.M J^TUOFNT and faculty ratings mi M6 26 M6 17 M3 15 33 69 27 3o 
OF THE SAHE EVALUATION SCORE.S 



NOTE: 



EACH OF THE .^ET OF 18 STUDENT/COURSE/INSTRUCTOR CHARACTER I5iT iCS WERE OBTAINF.D FROM THE STUDENT CVAL»JATION SURVEY FORM 
THE FACULTY SELF-KVALUATIOn FORM. OR THE REGISTRAR'S LISTING OF CLA^nES. EAfH OF THE.E VARIABLKS WAS THEN CORRELATED 
WITH THE 11 EVALUATIONT, OF TEACHING EFKKCTI VENKSS FACTOR SCORES AND THE TwO OVERALL !^UMMARY ITF.Mri), SEPARATE SETS OF 
COMREF.ATIONS WERE COMPUTED FOR CLASS-A VKR AGE STUDENT EVALUATIONS AND FACULTY SELF -EVALUAT IONS (VALUES IH PARENTHESES), 



NOTE: CORRELATION COEFFICIENTS ARE PRESENTED WITHOUT DFClMAL POINTS. CORRK^ATIONS GRKATKR THAN .15 ARE STATISTICALLY SIGNIFICAM 



